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TITLE OF THE INVENTION 
METHOD AND SYSTEM FOR FRACTAL-BASED ANALYSIS OF MEDICAL 

IMAGE TEXTURE 

BACKGROUND OF THE INVENTION 
FIELD OF THE INVENTION 

[0001] The present invention relates to a method, system and computer program for 
computerized assessment of breast cancer risk using fractal-based texture analysis. 
[0002] The present invention also generally relates to automated techniques for the 
detection of abnormal anatomic regions, for example, as disclosed, in particular, in 
one or more of U.S. Patents 4,907,156; 5,133,020; 5,832,103; and 6,138,045; all of 
which are incorporated herein by reference. 

[0003] The present invention also generally relates to computerized techniques for 
automated analysis of digital images, for example, as disclosed in one or more of U.S. 
Patents 4,839,807; 4,841,555; 4,851,984; 4,875,165; 4,918,534; 5,072,384; 5,150,292; 
5,224,177; 5,289,374; 5,319,549; 5,343,390; 5,359,513; 5,452,367; 5,463,548; 
5,491,627; 5,537,485; 5,598,481; 5,622,171; 5,638,458; 5,657,362; 5,666,434; 
5,673,332; 5,668,888; 5,732,697; 5,740,268; 5,790,690; 5,873,824; 5,881,124; 
5,931,780; 5,974,165; 5,982,915; 5,984,870; 5,987,345; 6,011,862; 6,058,322; 
6,067,373; 6,075,878; 6,078,680; 6,088,473; 6,112,112; 6,141,437; 6,185,320; 
6,205,348; 6,240,201; 6,282,305; 6,282,307; 6,317,617 as well as U.S. patent 
applications 08/173,935; 08/398,307 (PCT Publication WO 96/27846); 08/536,149; 
08/900,189; 09/027,468; 09/141,535; 09/471,088; 09/692,218; 09/716,335; 
09/759,333; 09/760,854; 09/773,636; 09/816,217; 09/830,562; 09/818,831; 
09/842,860; 09/860,574; 60/160,790; 60/176,304; 60/329,322; 09/990,311; 
09/990,310; 09/990,377; and 60/331, 995; and PCT patent applications 
PCT/US98/15165; PCT/US98/24933; PCT/US99/03287; PCT/US00/41299; 
PCT/US0 1/00680; PCT/US01/01478 and PCT/US0 1/0 1479, all of which are 
incorporated herein by reference. 

[0004] The present invention includes the use of various technologies referenced and 
described in the above-noted U.S. Patents, as well as described in the references 
identified in the following LIST OF REFERENCES by the author(s) and year of 
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publication and cross-referenced throughout the specification by reference to the 
respective number in parentheses, of the reference: 
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DISCUSSION OF THE BACKGROUND 

[0006] Breast cancer risk assessment provides an opportunity to devise appropriate 
surveillance plans that may include enhanced screening for women at increased risk of 
breast cancer. Computerized analysis of mammographic parenchymal patterns 
provide an objective and quantitative characterization and classification of these 
patterns, which may be associated with breast cancer risk. Computerized assessment 
of breast cancer risk based on the analysis of mammograms alone or combined with 
epidemiologic risk factors (for example, age) may serve as an alternative to current 
existing clinical methods, which are costly and/or information-dependent, in 
predicting breast cancer risk. 

[0007] Breast cancer is the most frequently diagnosed cancer and is the second 
leading cause of death in women with an estimate that 203,500 new invasive cases 
will be diagnosed as malignant and 39,600 women will die from breast cancer in the 
United States in 2002 [1], Studies show that mammography is the most effective 
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imaging technique for the early detection of breast cancer [2, 3] 5 which has potential 
to reduce the breast cancer mortality by as much as 30% [4]. Annual mammography 
has been recommended for women age 40 and older by American Cancer Society [1]. 
There is a need to develop new strategies, methods and systems for identifying breasts 
that are at risk of invasive breast cancer at the earliest stage in the multi-step 
carcinogenesis process. 

[0008] Pattern recognition based on the concept of traditional fractal dimension has 
been applied in many complex systems. Many physical systems in nature are suitable 
to be described by fractal dimensions to show both self similarity and the overall 
roughness at multiple scales [5]. Several studies reported that fractal dimension has 
been useful in characterizing complex biological systems which are similar to 
mammographic patterns [6-8]. All these suggest that fractal based computerized 
radiographic markers may be a useful descriptor to characterize mammographic 
parenchymal patterns. 

[0009] The breast is composed primarily of two components, fibroglandular tissue 
and fatty tissue. The average breast consists of 50% fibroglandular tissue and 50% 
fat. Fibroglandular tissue is a mixture of fibrous connective tissue and the glandular 
epithelial cells that line the ducts of the breast (the parenchyma). The major breast 
diseases develop from the terminal ductal lobular units of the breast, and arise 
predominantly from the epithelial cells that line the ducts; however, the fibrous or 
connective tissue can also be involved. It is thought by most experts that malignant 
breast disease develops through a process that starts with epithelial hyperplasia, i.e., 
an increase in the number of epithelial cells. Epithelial hyperplasia can progress to 
atypical hyperplasia in which the epithelial cells not only increase in number, but also 
change in a way that is not normal for these cells. The process, at this stage, is 
believed to be reversible. Once a certain criterion level of atypia is reached, the 
diagnosis of carcinoma-in-situ can be made, in which there is no invasion of 
malignant cells outside of the duct. The process of malignant transformation is 
considered irreversible at this stage. In the last phase of development, the cancer cells 
break out of the ductal walls and invade the surrounding stromal tissue, and at this 
point the disease is called infiltrating or invasive carcinoma. Most (80%-85%) breast 
carcinomas can be seen on a mammogram as a mass, a cluster of tiny calcifications, or 
a combination of both. Other mammographic abnormalities are of lesser specificity 



and prevalence than masses and/or calcifications, and include skin or nipple changes, 
abnormalities in the axilla, asymmetric density, and architectural distortion. 
[0010] Early detection of breast cancer can improve survival rates. The overall five- 
year survival rate for women diagnosed with breast cancer is 84%, but when found at 
a small, localized stage, the 5 -year survival rate is 97% [9]. Studies show that use of 
screening mammography can reduce lesion size and stage at detection, improving the 
prognosis for survival. Currently, mammography is a well-established imaging 
technique for early detection of breast cancer. Annual screening mammography is 
recommended by the American Cancer Society for all women over the age of 40 [9]. 
[0011] Clinical acquisition of x-ray mammograms is a rather complicated procedure 
and requires specific techniques in order to obtain high quality images. Attenuation 
differences between various structures within the breast contribute to image contrast. 
Due to the similar composition of breast structures and the physical manifestations of 
breast carcinoma, screen-film mammographic imaging must be substantially different 
from general radiographic imaging. Low-energy x-rays are required to enhance the 
ability to differentiate between normal tissues and carcinoma. The radiological 
appearance of the breast varies between individuals because of variations in the 
relative amounts of fatty and fibroglandular tissue. Since fat has a lower effective 
atomic number than that of fibroglandular tissue, there is less x-ray attenuation in fatty 
tissue than in fibroglandular tissue. Fat appears dark (i.e., higher optical density) on a 
mammogram, while fibroglandular tissue appears light (i.e., lower optical density) on 
a mammogram. Regions of brightness associated with fibroglandular tissue are 
normally referred to as "mammographic density." 

[0012] Screening mammography typically includes two standard radiographic 
projections, medio-lateral oblique (MLO) and cranio-caudal (CC), that are taken of 
each breast (right and left) for a total of four images. The purpose of these two views 
is to completely image the breasts and, if any lesions are present, allow localization 
and preliminary characterization. 

[0013] As the best method for early detection of breast cancer, annual screening 
mammography has been recommended for women over 40 years of age [9]. 
Mammographic surveillance for women under age 40 years who are at very high risk of 
developing breast cancer, however, still remains an issue, since the benefit of screening 
women in this age group has not been proven. Women at high risk of developing breast 
cancer tend to develop breast cancer at a younger age [10]. Identification and close 
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follow-up of these high-risk women may provide an opportunity for early breast cancer 
detection. Thus, computerized methods that are capable of assessing breast cancer risk 
may allow women and their physicians to devise an individualized surveillance plan that 
may include enhanced screening for women at high risk for early detection of breast 
cancer. These plans may lead to improvements in the overall efficacy of screening 
mammography for early detection of breast cancer. Further, knowledge of which 
women are at high risk of developing breast cancer has important implications in the 
study of breast cancer. 

[0014] There are two widely used methods to measure risk: relative risk and absolute 
risk [11]. Relative risk is defined as the ratio of age-specific breast cancer incidence 
rate among women with specific risk factors to the incidence rate among women 
without known risk factors. Relative risk estimates are useful for measuring the 
relative magnitude of effect of a given risk factor as a population risk. However, 
relative risk estimates do not directly approximate the underlying probability of a 
diagnosis of breast cancer for an individual over time. 

[0015] Absolute risk (or cumulative risk) is defined as the probability that a woman 
with given risk factors and given age will develop breast cancer over a defined time 
period. Absolute risk estimates give women a realistic and individualized estimate of 
the chance of developing cancer over various time horizons. An assessment of 
cumulative risk over different periods of time can help a woman understand the extent 
of her risk and therefore, can be useful in helping the woman and her doctor define an 
acceptable surveillance plan for the future. 

[0016] For decades, it has been known that all breast cancers are genetic, i.e., the 
development of breast cancer is the result of alteration of chromosomal DNA through 
mutation or damage with the resultant loss of normal growth regulation [12]. 
Sporadic breast cancer results from somatic changes that are specific to the tumor 
cells, i.e., the epithelial cells of the breast, which are not found in other cells of the 
patient. Recent molecular studies demonstrate that breast cancer may be inherited 
[10, 13, 14]. Using genetic linkage analysis to identify a gene named BRCA1 (breast 
cancer 1), was found to be responsible for the breast cancer diagnosed in women who 
inherited a mutated form of the BRCA1 gene in all cells (germline mutation) at birth. 
Since then, four other genes responsible for breast cancer, including the BRCA2 
(breast cancer 2) gene, have been identified [15]. In general, hereditary breast cancer 
appears earlier than purely sporadic breast cancer, because among women with 



inherited susceptibility, one of the cancer-causing mutations is present from birth. 
Thus, fewer somatic mutations specific to breast cancer cells need to occur. 
[0017] It is estimated that women who inherit a mutated form of the BRCA1 gene 
have as much as a 20% risk of developing breast cancer by age 40 years, a 33%-73% 
risk of developing breast cancer by age 50 years, and an 56%-87% risk of developing 
breast cancer by age 70 years [16, 17] which is about up to 8 times higher than the 
lifetime risk for the general population. The recent isolation of BRCA1 and BRCA2, 
and the acknowledgment that additional breast cancer susceptibility genes may exist, 
provides a molecular basis for counseling some high-risk women. 
[0018] Although the evidence of familial aggregation of breast cancer suggests that 
there is an important hereditary component, there are many families in which breast 
cancer (familial breast cancer) has appeared more than once purely by chance and not 
as the result of inherited susceptibility. Studies show that truly hereditary breast 
cancers accounts only for 5%-10% of all breast cancers [18, 19], and most breast 
cancers occur sporadically and are likely the result of random events on the cellular 
level. In addition to age, many factors have been identified to be related to breast 
cancer risk. Although, the basic mechanism underlying the association between 
breast cancer and these risk factors is not well understood. It has been recognized for 
some time that varying levels of endogenous and exogenous estrogens have been 
associated with the risk of developing breast cancer. Higher levels of endogenous 
hormones, in particular estrogens, are an important factor in the etiology of breast 
cancer [20, 21]. 

Table 1 . Selected factors for breast cancer risk. 
Factor Comparison group Approximate 



relative risk 



Age 



40-44 



Age 25-29 



16 



50-54 



28 



60-64 



44 



70-74 



56 



Western country 



Japan 



5 
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Family history of breast cancer 

One affected first-degree relative No affected first-degree 1 .4-3 

relative 

Two or more affected first-degree 4-6 
relatives 

Early age (30 yrs old) of onset in Age 50 2.6 
affected relative 

Reproductive history 

Age at menarche, 1 1 Age 16 1.3 

Age at first live birth 

20-247 <20 1.3 
25-29 1.6 
> 30, nulliparous 1 .9 

Age at menopause 

After 55 Age 45-55 1.5 
Before 45 0.7 

Evidence of breast pathology 

Any benign disease No biopsy or aspiration 1 .5 
Proliferative disease 2 
Atypical hyperplasia 2-4 

History of cancer in contralateral breast No history of cancer 5 

Percent dense parenchyma on 
mammography 

5%-24.9% < 5% dense regions 1 .7 

25%-44.9% 2.5 
45%-64.9% 3.8 
>65% 4.3 



Exposures 
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Radiation, lOOrads 
Alcohol, two drinks/day 



No special exposure 
Non drinker 



3 

1.7 



From Gail MH, Benichou J. Assessing the risk of breast cancer in individuals. 
In deVita VT, Helman S, Rosenberg SA (eds): Cancer Prevention, Philadelphia, JB 
Lippincott; 1992, pp 1-15. 

[0019] Risk factors for breast cancer can be classified broadly as being of either 
personal or environmental origin. Personal risk includes aspects of individual 
biological histories, such as family history of breast cancer, reproductive history, 
menopausal status, and breast disease history. Environmental risk factors are 
exogenous influences, such as diet and exposure to environmental carcinogens. Table 
1 lists selected factors that have a strong or well-established association with breast 
cancer. These factors were identified on the basis of large epidemiologic studies [22]. 
[0020] Among these risk factors, age has been identified as the single most important 
risk factor for the development of breast cancer in women. The incidence of breast 
cancer increases with age. Studies show that diagnosis of breast cancer is rare before 
age 25 years [23]. The incidence of breast cancer increases rapidly between the ages 
of 25 and 44. Near the age of menopause, the rate of increase in incidence for 
successive age groups is slower compared with the observations in premenopausal 
women. In addition to age, risk factors such as family history of breast cancer, 
personal history of breast cancer, biopsy-confirmed benign proliferative breast 
disease, and age at first live birth and at menarche have been identified and have been 
used in clinical risk prediction models [11, 22, 24] to estimate an individual's risk of 
developing breast cancer. 

[0021] Increased mammographic density is another factor that has been found to be 
associated with an increased risk of breast cancer. It has been shown in several 
studies that women with increased mammographic parenchymal density are at a four- 
to six-fold higher risk over women with primarily fatty breasts [25-30]. At present, 
the reason for this increased risk is unclear. One possibility is that increased density 
reflects a larger amount of tissue at risk for developing breast cancer. Since most 
breast cancers develop from the epithelial cells that line the ducts of the breast, having 
more of this tissue as reflected by increased mammographic density may increase the 
chances of developing breast cancer. 
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[0022] The Gail and Claus models are used to estimate individual risk over a woman's 
lifetime (up to 79 years old) and during the next 10 years of her lifetime, which are 
referred to as the lifetime risk and the 10-year risk of developing breast cancer. 
[0023] The Gail model [31] was developed based on case-control studies involving 
2,852 white women with incident breast cancer and 3,146 white controls selected 
from the Breast Cancer Detection Demonstration Project (BCDDP) population data. 
The risk factors used in the Gail model are age, age at menarche, age at first live birth, 
number of previous breast biopsies, number of first-degree relatives with breast cancer 
and history of biopsy with hyperplasia [22, 31]. These risk factors are broadly 
consistent with those selected from other large population-based studies [22]. 
Because the Gail model was developed from a database which includes only white 
women who tend to return for annual mammographic screening [22], it is anticipated 
that this model would overpredict risk in younger, unscreened women since the 
BCDDP population had a higher prevalence of women with adverse risk factors than 
the general population [22, 31]. 

[0024] The Claus model [24] was derived from the Cancer and Steroid Hormone 
(CASH) Study, which was a multicenter, population-based, case-control study. The 
data consists of 4730 patients with histologically confirmed breast cancer, age 20-54 
years, and 4688 control subjects. The control subjects were frequency-matched to 
patients according to geographic region and 5 -year categories of age. The aim of the 
study conducted by Claus et al. differs from that of Gail et al. in that Claus et al. 
intended to address the issue of risk calculation solely for a subset of women who are 
at potentially high risk for breast cancer, i.e., women with a family history of breast 
cancer. For these women, it appears that the number and the type of relatives affected 
with breast cancer as well as the ages at onset of any affected relative may be the most 
important risk factors, more so than risk factors such as age at first live birth or age at 
menopause that are used in the Gail model. Claus et al. found in their data that risk of 
individuals increased as "age at onset" of their affected relatives decreased [24] . On 
the other hand, Gail et al. did not find, in their data, that age at onset was helpful in 
the prediction of risk once the number of relatives affected was considered [22, 31]. 
[0025] Because the risk factors used in the Gail model are more consistent with those 
selected from other studies, the Gail model was able to be validated on other large 
databases. Validation studies [32, 33] have shown that the Gail model predicts risk 
most accurately in women who undergo yearly mammographic screening and 
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overpredicts risk for women who do not undergo yearly mammographic screening. 
Another validation study, which involved 109,413 women from the Nurses' Health 
Study, showed that the correlation coefficient between observed risk from the 
database and predicted risk from the Gail model was 0.67 [33]. These validation 
studies demonstrated that, for accurate estimation, the Gail and Claus models should 
be applied only to a population similar to those from which the models were derived. 
[0026] With the increasing awareness of breast cancer risk and the benefit of 
screening mammography, more women in all risk categories are seeking information 
regarding their individual breast cancer risk. The need exists for primary care 
clinicians to be able to assess an individual's risk of developing breast cancer and 
offer an appropriate surveillance program for each individual [34, 35]. Identification 
and close surveillance of women who are at high risk of developing breast cancer may 
provide an opportunity for early cancer detection. 

[0027] Breast cancer risk assessment is an emerging service which includes 
determination of risk, recommendations for surveillance, and counseling for women at 
elevated risk. Currently, several prediction models based on large epidemiologic 
studies [11] have been developed to predict risk using known risk factors such as a 
woman's age, her family and personal histories of breast cancer, and gynecological 
information. Among them, the Gail model and the Claus model are the most 
commonly used for prediction of an individual's breast cancer risk [34]. These 
models are used by clinicians for counseling women who are seeking information 
regarding their individual breast cancer risk. The Gail model was used to identify 
women at high risk for the entry to the Tamoxifen Prevention Trial. Recently, Offit 
and Brown [11] reviewed four major models of risk prediction and provided a 
comparison of the different models. Since each of these models was derived with a 
different study design and used different factors to calculate risk, risk estimates for a 
given individual obtained from each of the models differed slightly. It was anticipated 
and confirmed that these models, which use a few selected risk factors, only predict 
risk accurately for the populations similar to those from which the models were 
developed [22, 24, 31, 32, 33, 36]. Clinicians have been instructed to select models 
carefully since each of these models was designed based on a particular population. 
Further, the risk predicted from these models must be justified according to clinical 
observations since information such as a positive result from a DNA test for the 
BRCA1 IBRCA2 -mutation supersedes routine projections from a model [34, 36]. 
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Nevertheless, the models provide an epidemiologic basis for risk prediction and serve 
as guidelines for counseling patients until more refined predictions based on 
molecular characterization or other methods become available. 
[0028] Over the past twenty years, the association of breast cancer risk with 
mammographic parenchymal patterns has been investigated. In 1976, Wolfe first 
described an association between risk for breast cancer and different mammographic 
patterns [37]. He described four patterns of breast parenchyma (N 1 , PI, P2, and DY) 
associated with different risk levels of developing breast cancer. An Nl (lowest risk) 
pattern indicates a breast in which the breast is composed entirely of fat tissue. PI 
(high risk) and P2 (high risk) patterns refer to increasing ductal prominence (a PI 
pattern consists of ducts occupying less than 25% of the breast and a P2 pattern 
consists of ducts occupying more than 25% of the breast). A DY pattern (highest risk) 
refers to a breast which is largely occupied by diffuse or nodular densities. Many 
investigators have used Wolfe patterns to classify the mammographic appearance of 
breast parenchyma for risk prediction [38]. Others have used qualitative or 
quantitative estimates of the proportion of the breast area (percent density) that 
mammographically appears dense to assess the associated breast cancer risk. 
[0029] Since Wolfe's work, interest in the possible association of mammographic 
parenchymal patterns with breast cancer has varied [39-41]. Wolfe's initial reports 
were landmark studies in this field, however, the results provoked various criticisms, 
for example, possible bias in the results due to the "masking" effect. Studies showed 
that breast cancer was most easily detected by mammography in fatty breasts and was 
most difficult to detect in breasts with dense parenchyma, thus there were more 
cancers missed by mammography in women with dense breasts [42]. The hypothesis 
of the "masking effect" [39] said that the observed greater risk of breast cancer in 
women with dense breasts was due to the fact that these missed cancers in the dense 
breast at the initial classification declared themselves on subsequent follow-up. 
[0030] Several groups [28, 43] have conducted experiments to examine the masking 
hypothesis. Whitehead et al. [43] examined the masking hypothesis by using data 
from the Breast Cancer Detection and Demonstration Project (BCDDP). They found 
that the masking of cancer did occur in breasts with dense parenchyma; however, their 
results showed that the effect of the masking on estimation of breast cancer risk was 
small. They concluded that women with dense breasts have two disadvantages: 1) 
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they were at increased risk of developing breast cancer, and 2) cancers occurring in 
dense breast parenchyma were more difficult to detect. 
[0031] During the time of this controversy, many investigators studied the 
relationship between the mammographic patterns and breast cancer risk using the 
Wolfe method or percent density methods. Considerable variations were observed in 
reported results. In 1992, Warner et al. [38] carried out a meta-analysis using 35 
publications to examine the effect of different methods on the assessment of breast 
cancer risk. They grouped the studies according to their designs and methods used, 
and determined the magnitude of the risk of breast cancer associated with 
mammographic density for the studies in each group. They found that the estimated 
relative risk of developing breast cancer depended on the methods that were used to 
classify mammographic patterns and ranged from 0.53 to 5.19. Based on the meta- 
analysis, they concluded that women with dense breasts have an increased risk of 
breast cancer relative to those with fatty breasts. 

[0032] While visual assessment of mammographic patterns has remained 
controversial due to the subjective nature of human assessment [44], computer vision 
methods can yield objective measures of breast density patterns. Computerized 
techniques have been investigated to quantitatively evaluate mammographic 
parenchyma and identify women that are at risk of developing breast cancer. 
Computerized density analysis of mammographic images has been investigated by 
various investigators including Magnin et al. [45], Caldwell et al. [46], Taylor et al. 
[47], Tahoces et al. [48], and Byng et al. [49, 50]. 

[0033] Magnin et al. [45] tried to classify mammograms into four categories (Wolfe 
patterns) using texture parameters extracted from co-occurrence matrices, the spatial 
gray level dependence method (SGLDM), and the gray level difference method 
(GLDM). They claimed that their result was inconclusive because a limited number 
of cases (27 mammograms) were used and the quality of the images used in the study 
was poor [37]. Caldwell et al. [46] used fractal dimension analysis to classify 
mammograms into the four patterns described by Wolfe, yielding 84% agreement ^ 
with that of radiologists. Tahoces et al. [48] investigated the ability of linear 
discriminant analysis to quantify Wolfe patterns by merging texture measures 
obtained from Fourier transform method, local contrast analysis, and gray-level 
distribution. Their results showed that agreement (22%-77%) among radiologists and 
the computer classification varied depending on the Wolfe patterns. Taylor et al. [47] 
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used a local skewness measure to separate fatty and dense breasts, yielding 85% 
classification accuracy for 106 mammograms. Byng et al. [49, 51] investigated a 
semi-automated interactive thresholding technique based on visual assessment and 
computerized texture analysis (a local skewness measure and fractal dimension 
analysis) to quantify the percent density of breasts. Their results showed that 
computerized assessment of mammographic density using the texture measures (R = - 
0.60) correlated well with the visual assessment (subjective classification) of the 
projected area of mammographically dense tissue. Furthermore, they showed that 
increased mammographic density was associated with an increased relative risk by a 
factor of 2 to 4. Their results also showed that the relative risk estimates obtained 
using the two computer-extracted texture measures were not as strong as those from 
their subjective mammographic classification method. 

[0034] A conventional method for discriminating a high risk group of women from a 
low risk group is illustrated in Figures 1 and 2. Figure 1 shows an original breast 
image 1 which is taken for example with an x-ray device. From this medical image, a 
region of interest 3 is selected for a detailed analysis. The selected region of interest 
has a size of 256 by 256 pixels. Figure 2 shows a diagram in which the medical 
image data is acquired in step 1. In step 3, the medical image data is digitized. 
Further, in step 5, the region of interest is selected for analysis. In step 7, the region 
of interest selected in step 5 is computer analyzed and various computer-extracted 
texture features are obtained. In step 9 linear discriminant analysis is applied to the 
computer-extracted features and a marker is calculated based on these features. Based 
on this marker, in step 1 1 , a group of women are discriminated in high and low risk 
groups. However, the conventional methods known do not produce an accurate 
discrimination for the two groups. 

[0035] Therefore, development of a better and more accurate computerized method to 
automatically extract features that characterize mammographic parenchymal patterns 
and relate to breast cancer risk would benefit women seeking information regarding 
their individual breast cancer risk. 

SUMMARY OF THE INVENTION 
[0036] Accordingly, an object of this invention is to provide a method and system for 
obtaining a fractal characteristic based on computerized fractal-based analysis of a 
structure as presented on a medical image. 
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[0037] Another object of this invention is to provide a method and system for the 
computerized fractal-based analysis of parenchymal structure as presented on a 
medical image and relate the analysis to risk of breast cancer. 
[0038] Another object of this invention is to provide a method and system for 
extracting information from fractal-based texture analyses. 

[0039] These and other objects are achieved according to the invention by providing a 
new and improved method and system for the analysis of structure as presented as 
texture on a medical image. Specific applications are given for the analysis of regions 
within the mammographic breast. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0040] A more complete appreciation of the invention and many of the attendant 
advantages thereof will be readily obtained as the same becomes better understood by 
reference to the following detailed description when considered in connection with the 
accompanying drawings, wherein: 

[0041] Figure 1 is an illustration of a medical image of a breast and a region of 
interest of the medical image; 

[0042] Figure 2 is a schematic diagram illustrating a conventional method for analysis 
of a structure as presented on the medical image; 

[0043] Figure 3 is a schematic diagram illustrating a novel method for analysis of the 
structure presented on the medical image according with the present invention; 
[0044] Figure 4 is a graph showing a relationship between a log area and a log of a 
relative length from a surface area fractal analysis of the region of interest; and 
[0045] Figure 5 is a graph showing a relationship between the normalized volume and 
a scale from a fractal analysis of the region of interest. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0046] Referring now to the drawings, and more particularly to Figure 3 thereof, a 
schematic diagram of the fractal-based analysis of a texture of a region of interest as 
presented on a medical image is shown. In this embodiment, the aim is to extract the 
multi-fractal characteristics of the structure (from the parenchymal pattern) using 
computer analysis of fractal data. Step 1 in Figure 3 shows an initial acquisition of a 
medical image of a breast from a film or any other means used for obtaining 
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radiographic image data of the breast. A digitization of the medical image is 
accomplished in step 3. Further, in step 5, a region of interest is selected for a further 
analysis. In step 7, the region of interest selected in step 5 is computer analyzed and 
fractal-based computer-extracted texture features are obtained. In step 9 a linear 
discriminant analysis is applied to the fractal-based computer-extracted features and a 
marker is calculated based on these features. Further, based on this marker, in step 11, 
a group of women are discriminated in high and low risk groups. The robustness of 
the results produced by this novel method was tested against a database. 
[0047] The database used for running and evaluating the method of this embodiment 
contains 30 BRCA1 and BRCA2 gene-mutation carriers and 60 low-risk women 
which were randomly selected and age-matched at 5-year intervals with these gene 
carriers [52]. Mammograms were digitized in step 3 at 0.1 -mm pixel size and 10-bit 
gray level. Regions of interest (ROIs), 256 by 256 pixels in size [52], were manually 
selected in step 1 of Figure 3 from a central breast region immediately behind the 
nipple [53]. 

[0048] Four approaches based on fractal analysis were performed. The first method 
used the conventional box-counting technique [54], 



where A(s) is the surface area of the ROI at effective pixel size 8 and ij(x,y) is the gray 
level at location (x,y) in the digitized image data. A(s) is calculated using equation (1) 
for various pixel sizes e. For a two-dimensional mammogram, the fractal dimension 
is defined by equation (2), 

D BC =2-limM^l. (2) 
s-+q log[e] 

[0049] The second method used a modified box-counting technique. Linear 
discriminant analysis (LDA) was applied on surface areas A(e) calculated at multiple 
pixel sizes e. The output from LDA yielded a new class of radiographic markers that 
are similar to the fractal dimension extracted using equation (2). 
[0050] Figure 4 shows the points used to calculate the fractal dimension BClda and 
the fractal characteristic based on the second method. 
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[0051] The third method used the global Minkowski dimension, DM(f), calculated 
using 



D M [f] = lim 

8-»0 



log(Vs) 



(3) 



where for a structuring element g at scale e, V g (s) is the "volume" between two 
processed versions of f obtained using morphological operators. The volume V g (e) is 
computed by 



where (f © eg) and (f ® eg ) are the dilated version and the eroded version, 
respectively, of the image obtained using the structuring element g at scale 8. Note 
that V (e) is the volume arising from the difference between the dilated and eroded 



log(l/e) gives the estimated fractal dimension as shown in Figure 5. A squared 
structuring element of 3x3 pixels was used. 

[0052] The fourth method used a modified Minkowski dimension that is similar to the 
second method. LDA was applied on volume V g (e) calculated at multiple scales. The 

output from LDA yielded a new class of radiographic markers that are similar to the 
fractal dimension extracted using equation (3). 

[0053] For comparing the accuracy of the computerized radiographic markers 
discussed above as indicators of risk, receiver operating characteristics (ROC) 
analysis [55] was used to evaluate the performance of the methods in the task of 
distinguishing between the BRCA1/BRCA2 gene-mutation carriers and low-risk 
group. The leave-one-out method (round-robin method) [56], was used in the 
performance evaluation for the LDA approaches. 



64 64 



V g (e)=X£{(f©eg)-(f<g>eg)} 



(4) 



surfaces. Finding the slope of the least-square fitted line between log[V g (e)/e ] and 



-20- 



[0054] Note that with both of these fractal based techniques, one is required to 
determine a slope as in Figure 5 or multiple slopes as in Figure 4, if the texture is 
multi-fractal in nature. This may be difficult due to the number of limited data points 
used in determining the slope (see Figures 4 and 5). However, the method presented 
in this embodiment incorporates a classifier (such as linear discriminant analysis or an 
artificial neural network) to determine the fractal nature of the texture and relate it to 
breast cancer risk, which make the results more accurate and reliable. 
[0055] The calculated fractal dimension shown in Figure 4 is based on the box- 
counting model, which uses the slope of the regression line between the logarithm of 
surface areas, log|/l(e)], and the logarithm of various pixel sizes, log[e], should yield a 
linear line for a fractal object. Plots of log[^4(e)] versus log[e] in Figure 4 show two 
linear components - a high frequency component and a low frequency component 
suggesting a multi-fractal phenomenon for the radiographic images. The fractal 
dimensions, D BC , were calculated using all data points, the high frequency 
components only and the low frequency components only. The results are shown in 
Table 2. Likewise, results are shown in Table 2 for the global Minkowski dimension 
and the LDA methods for both box-counting and Minkowski methods. The index 
T 

0.90 & z is used to indicate the performance above 90% sensitivity [57]. 



Table 2. Performance in differentiating between the two groups with fractal 
dimensions extracted using the four approaches. ROC results are from the round- 
robin method for the LDA approaches. 



Feature 


A z 


0.90A T Z 


D gc (All data points) 


0.77| 0.05 


0.34| 0.10 


D BC (High frequency component) 


0.861 0.04 


0.31 1 0.15 


D BC (Low frequency component) 


0.58| 0.06 


0.14| 0.06 


LDA BC (All data points) 


0.891 0.04 


0.491 0.15 


LDA BC (High frequency component) 


0.91 1 0.03 


0.561 0.14 


LDA BC (Low frequency component) 


0.551 0.06 


0.101 0.05 


DM[f] 


0.84.1 0.05 


0.261 0.13 


LDA M 


0.91 1 0.03 


0.551 0.14 
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[0056] A multi-fractal phenomenon in the parenchymal texture was observed using 
the method of this embodiment on normal digitized screen/film mammograms. These 
results extend prior application of conventional box-counting and Minkowski 
techniques by incorporating linear discriminant analysis to assess the multi-fractal 
characteristics. The use of fractal analysis to characterize mammographic 
parenchymal patterns yield radiographic markers for assessing breast cancer risk in 
patients. 

[0057] As disclosed in cross-referenced pending patent application 09/773,636, 
Figure 9 of that patent application is a schematic illustration of a general purpose 
computer 900 which can be programmed according to the teachings of the present 
invention. In Figure 9 of the cross-referenced application 09/773,636, the computer 
900 can be used to implement the processes of the present invention, wherein the 
computer includes, for example, a display device 902 (e.g., a touch screen monitor 
with a touch-screen interface, etc.), a keyboard 904, a pointing device 906, a mouse 
pad or digitizing pad 908, a hard disk 910, or other fixed, high density media drives, 
connected using an appropriate device bus (e.g., a SCSI bus, an Enhanced IDE bus, an 
Ultra DMA bus, a PCI bus, etc.), a floppy drive 912, a tape or CD ROM drive 914 
with tape or CD media 916, or other removable media devices, such as magneto- 
optical media, etc., and a mother board 918. The mother board 918 includes, for 
example, a processor 920, a RAM 922, and a ROM 924 (e.g., DRAM, ROM, 
EPROM, EEPROM, SRAM, SDRAM, and Flash RAM, etc.), I/O ports 926 which 
may be used to couple to an image acquisition device and optional special purpose 
logic devices (e.g., ASICs, etc.) or configurable logic devices (e.g., GAL and re- 
programmable FPGA) 928 for performing specialized hardware/software functions, 
such as sound processing, image processing, signal processing, neural network 
processing, automated classification, etc., a microphone 930, and a speaker or 
speakers 932. 

[0058] As stated above, the system of the present invention includes at least one 
computer readable medium. Examples of computer readable media are compact discs, 
hard disks, floppy disks, tape, magneto-optical disks, PROMs (EPROM, EEPROM, 
Flash EPROM), DRAM, SRAM, SDRAM, etc. Stored on any one or on a 
combination of computer readable media, the present invention includes software for 
controlling both the hardware of the computer and for enabling the computer to 
interact with a human user. Such software may include, but is not limited to, device 
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drivers, operating systems and user applications, such as development tools. Such 
computer readable media further includes the computer program product of the 
present invention for performing any of the processes according to the present 
invention, described above. The computer code devices of the present invention can 
be any interpreted or executable code mechanism, including but not limited to scripts, 
interpreters, dynamic link libraries, Java classes, and complete executable programs, 
etc. 

[0059] The programming of general purpose computer 900 (disclosed in cross- 
referenced pending patent application 09/773,636) may include a software module for 
digitizing and storing images obtained from film or an image acquisition device. 
Alternatively, the present invention can also be implemented to process digital data 
derived from images obtained by other means, such as a picture archive 
communication system (PACS). In other words, the digital images being processed 
may be in existence in digital form and need not be converted to digital form in 
practicing the invention. 

[0060] Accordingly, the mechanisms and processes set forth in the present description 
may be implemented using a conventional general purpose microprocessor or 
computer programmed according to the teachings in the present specification, as will 
be appreciated by those skilled in the relevant art(s). Appropriate software coding can 
readily be prepared by skilled programmers based on the teachings of the present 
disclosure, as will also be apparent to those skilled in the relevant art(s). However, as 
will be readily apparent to those skilled in the art, the present invention also may be 
implemented by the preparation of application-specific integrated circuits or by 
interconnecting an appropriate network of conventional component circuits. 
[0061] The present invention thus also includes a computer-based product which may 
be hosted on a storage medium and include instructions which can be used to program 
a general purpose microprocessor or computer to perform processes in accordance 
with the present invention. This storage medium can include, but is not limited to, 
any type of disk including floppy disks, optical disks, CD-ROMs, magneto-optical 
disks, ROMs, RAMs, EPROMs, EEPROMs, flash memory, magnetic or optical cards, 
or any type of media suitable for storing electronic instructions. 

[0062] Numerous modifications and variations of the present invention are possible in 
light of the above teachings. It is therefore to be understood that within the scope of 
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the appended claims, the invention may be practiced otherwise than as specifically 
described herein. 
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